A Transposition Rule Analysis Based on a Particle Process 



David Gamarnik Petar Momcilovic* 

IBM T.J. Watson Research Center EECS Department 

Yorktown Heights, NY 10598 University of Michigan 

gamarnik@watson.ibm.com Ann Arbor, MI 48109 

petar@eecs.umich.edu 

February 1, 2008 



Abstract 

A hnear hst is a collection of items that can be accessed sequentially. The cost of 
a request is the number of items that need to be examined before the desired item is 
located, i.e., the distance of the requested item from the beginning of the list. The trans- 
position rule is one of the algorithms designed to reduce the search cost by organizing 
the list. In particular, upon a request for a given item, the item is transposed with the 
preceding one. We develop a new approach for analyzing the algorithm. The approach 
is based on a coupling with a certain constrained asymmetric exclusion process. This 
allows us to establish an asymptotic optimality of the rule for two families of request 
distributions. 

Keywords: self-organizing list, average-case analysis, exclusion process 

1 Introduction 

The linear list, a collection of items that can be accessed sequentially, is one of basic data 
structures known in computer science. A primary operation defined on the list is search. A 
requested item is found in the list by sequentially examining items from the beginning of 
the list one by one. The cost of search is defined to be the distance of the requested item 
from the beginning of the list, i.e., the number of items that need to be examined in order 
to locate the desired item. Intuitively, one would like to place frequently requested items at 
the front of the list so that to minimize the number of search steps. If the request sequence 
were known a priori, one could place items in an order that minimizes the search cost. Yet 
often properties of the request sequence are either not known in advance or time dependent. 
Hence, it is desirable to employ an algorithm that organizes the list based on past requests. 
The two best known self-organizing algorithms are the move-to-front rule and transposition 
rule IllJ Section 6]. In addition to being simple, these rules are memory- free, i.e., require 
no memory for their operation. 

List organizing algorithms have been analyzed over the past fifty years, e.g., see review 
on self-organizing linear search in 8^ . While the literature on the move-to- front rule (and the 
corresponding least-recently-used caching algorithm) is extensive (see, e.g., |^ IHl 171 1^ ITU 0] 
and references therein), the results on the transposition rule are scarce. Early analysis of 
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the transposition rule can be found in jl3j . In the same paper it was conjectured that 
the rule is optimal with respect to the expected value of the search cost. However, it was 
shown in ^ that this conjecture is not true in general. Except for the papers mentioned 
above, the probabilistic analysis of the transposition rule is either limited to the case of 
simplistic distributions jlUl I15j or numerical studies |21 El ■ The reader is referred to ^1] 
for a combinatorial (amortized) analysis of the transposition rule. 

In the present paper we develop a new approach for analyzing the transposition rule. 
The approach is based on a coupling with a constrained asymmetric exclusion process. 
This allows us to establish an asymptotic optimality of the rule for two families of request 
distributions. Specifically, we prove that the logarithm of the tail probability of the search 
cost is asymptotically optimal under the transposition rule when the request distribution is 
either power law or geometric. 

The rest of the paper is organized as follows. The model description and main results can 
be found in the next section. In Section |31 we describe an associated asymmetric exclusion 
process and characterize its stationary behavior. Section |1] relates the exclusion process to 
the transposition rule for self-organizing lists. Section El contains the proofs of the results 
stated in Section [21 Conclusions and some open questions are discussed in Section jBl 

2 Model and results 

We consider an infinite list of items L = {1, 2, . . . , A^, . . .} = N. At integer times t = 
0, 1, 2, . . . a request arrives for an item from L. The item requested at time t is denoted 
by R{t). The requests are independent and identically distributed, and vrj denotes the 
probability of item i being requested, Yli>i ~ Without loss of generality we assume 
that TTj > vTj+i for all i. Let R be equal in distribution to R{t), i.e., ¥[R = i] = Hi. 

The evolution of the list L is governed by the transposition rule. At time t = the 
list is assumed to be ordered as {1, 2, . . . , A^, . . .}. Upon every request, the requested object 
is moved forward by one position in the list while the object in front of it is moved one 
position back. If the first item in L is requested, the list does not change. The basic idea 
is that frequently requested items are moved closer to the beginning of the list over time; 
on the other hand, items with low request probabilities end up at some distance from the 
beginning of the list. 

At every time t the list is represented as some permutation o" : N — > N. Let Xi{t) 
be the position of the item i in the list at time t. Our focus is on the behavior of the 
position C{t) := Xj^^^-^it) of the requested item, i.e., the search cost, as t — > oo. We note 
that if permutation a is fixed, then the distribution of C{t) is determined completely by 
vr := {vTj}^^. In this case we use C"^ to denote the random position of the selected element 
R. Namely, is simply the (random) search cost required to locate the requested item in 
a given list order a. Thus, there exist two sources or randomness affecting the search cost: 
one corresponding to the random arrangement a of the items, and one corresponding to the 
randomness of the requested item R. 

Our first lemma is a simple observation stating that for every permutation a the tail 
asymptotics of C dominates the tail asymptotics of R. 

Lemma 1. For any distribution tt, permutation a, and for every x € N 

IP[C"' > x] > ¥[R > x]. 
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Proof. The definition of the search cost renders 



j: a{j)>x 

>J2^j=¥[R>x], 

j>x 

where the inequahty is holds by the monotonicity of elements of vr. □ 

Thus, as far as the tail probability asymptotics is concerned, no list ordering algorithm 
can achieve a better performance than the one under the optimal static arrangement. Note 
that arranging item in the decreasing order of tTj is feasible only if the distribution vr is 
known in advance. 

We say that R is distributed as a power law with parameter a > 1 if vTj = ci~" for all 
i, where c^^ = X^j>i is the normalization constant. Random variable R is defined to 
be asymptotically geometric with parameter < < 1 when i^^ logTTj logz^ as i ^ oo. 
The next result states that the transposition rule is asymptotically optimal with respect to 
the logarithm of the tail asymptotics for these two distribution families. 

Theorem 1. Let vr be either power law with parameter a > 1 or asymptotically geometric 
with parameter < u < 1. Then 

^ ^ logF|C(,) > .] ^ 

x^oot^oo logP[i?>x] 

Proof. See Section [SJ □ 

Often it is of interest to consider list that contain only a finite number of items, i.e., 
TT has a finite support. Although we will not make use of the following fact, we remark 
that for every distribution vr with finite support on {1,2,... ,N} (vrj = for all i > N) 
the described system is an irreducible, reversible, aperiodic Markov chain, and the unique 
stationary solution is of the following product form 



P[Xi =ii, X2=i2,---,XN =iN] 



E(fci,...,fcjv)6Pjvnj=i^j 



where P^ denotes the set of all permutations of list = {1, 2, . . . , A^}. A natural way to 
introduce a power law and geometric distribution for the case of finite support is to take 
the distribution vr conditioned on event {i < A}. Denote by vrTv the truncated distribution 
and let random variable Rn be defined by P[-R7v > x] := ¥[R > x\R < A]. Note that the 
existence of a unique stationary distribution for every A allows us to consider the stationary 
search cost denoted by Cat. 

Theorem 2. Let either (i) vr^v he truncated power law with parameter q > 1 and x/N < 7 
for some 7 < 1 or (ii) ttn be truncated asymptotically geometric with parameter < < 1 
and X < N. Then 

{x,N}^oo logPfi^AT > x] 

Proof. See Section |S1 □ 
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Figure 1: An example of evolution of the system with 4 particles. The arrows indicate 
intended movements of particles. Movements actually occur only in the first two instances. 

3 Constrained asymmetric exclusion process 



In this section we consider a certain constrained asymmetric exclusion process in which we 
examine the deviation of the boundary particle from its minimal position. In particular, 
we consider an n particle system on countably many slots on a half-line enumerated from 
left to right as 1, 2, . . .. Each particle is associated with an independent Poisson process of 
unit intensity (the actual rate is not important). At arrival instances of their correspond- 
ing Poisson processes particles move left or right with probabilities p and q, respectively. 
Multiple occupancies are not allowed and a move actually occurs only if the target slot is 
empty (see Fig. ^ for an example). A particle can not move left if it is located in the first 
slot. 

Assume that p > q and define /? := q/p < 1. Given that particles are enumerated from 
left to right with natural numbers, let Zi be the position (slot number) of the i-th particle. 
We first verify in a straightforward way that the stationary distribution is of the following 
form 

P [Zi = h, Z2 = t2,...,Zn = in] = Vn' (1) 

for all 1 < < ^2 < • • • < in, where is the normalization constant. To this end 



E 

l<il<i2<-<in 



En 



E E 



E'/=i 



1-/3 



< oo. 



and, therefore, the normalization constant ry„ is finite. It is easy to check that the under- 
lying Markov chain is irreducible, reversible, aperiodic and that satisfies the stationary 
equation. Hence, indeed describes the stationary distribution. 

We point out that the minimal possible value of Y17=i SILi ^ 



n(n + l)/2. 



Throughout the paper we interpret Yli 



= 1 for k < j. 



Lemma 2. The normalization constant satisfies 



Vn 



n{n + l) 



n 
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Proof. For each integer /c > let ijn^k denote the sum of ^ over the feasible choices 

of ij such that maxi<j<„{ij} < n + k. Then clearly rinfl < f?n,i < • • • and rjn^k Vn as 
k ^ oo. We claim that for every n, k 

The expression for r/„ follows immediately by taking the limit as /c ^ oo in ((21). 

The proof of (|2J) is by induction. It is trivial to validate that r/„^o = P ^ , n > 1, and 
Vi,k = Si^i ^ ^ 0, conform to 0. Next, we assume that ^ holds for r/jj for all i,j 
such that either i<n, j<kor:i<n, j<k and show that the statement is true for r]n,k- 
The quantity rjn,k satisfies the following equality 

r]n,k = r]n,k-i + P'^^''r]n-i,k; (3) 

the first term corresponds to the case max]^<j<n{^j} ^ ?i -t- /l, while the second one to the 
case maxi<j<„{ij} = n + k, i.e., the last (n-th) particle is in the slot n + k. From (jS)) and 
the inductive assumption one derives 

r^n,k = n + (5^-^^-^^ n 

i=l ^ i=l ^ 



k 



n(n + l) -p-|- 1 — j3 



nyn + L) -p-|- l - 



i=l 

This concludes the proof. □ 

Next, we use to examine the stationary deviation k„ of the last particle from its 
minimal position, i.e., := Z„ — n > 0. Expressions © and Lemma [21 yield 

n-1 

= Ha </3\ (4) 
i=i 

and, thus, 

P[k„ >i] < (5) 

Interestingly, this implies that there exists a limiting behavior for the case when the number 
of particles n grows to infinity. Indeed, as n — oo, the random variable k„ converges in 
distribution to a random variable k with distribution given by 

oo 

P[K=i]=/3^n(i-/3^+^-). 

From the preceding we conclude that k is asymptotically geometric with parameter (5 

lim P[k = i\l3~' = 1, 

and that random variable k is stochastically monotone in parameter (3. Finally, we note 
that for P < 1/2, or equivalently 2q < p, the most probable value of k is zero. 
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4 Coupling 



The following lemma relates the stationary (as t oo) properties of list L operating under 
the transposition rule and characteristics of the particle system studied in Section Let 
Ki{(3) explicitly denote the dependency of the random variable Kj on parameter /3 (see @) 
and set /? := vrj+i/vrj < 1. 

Proposition 1. For every x > i > 1 



lim 



\/x,(t) 



> X 



<F[Ki{p)+i > x] 



Remark 1. When the support of vr is finite there exists a unique stationary solution and, 
therefore, the left side of the preceding inequality converges as t — > oo. 

Proof. The proof is based on a coupling argument. We start by exploiting a Poisson embed- 
ding technique (see j6j for an application of the technique in the context of the move-to- front 
rule). The requests for item i, form a Poisson process of intensity vTj. Then, the limiting 
behaviors (as t ^ oo) of the original discrete-time system and the system with the Poisson 
request patterns are the same. 

Given a Poisson process (set of arrival times) A with rate A, let A{p) denote its subset, 
A{p) C A, formed by including each element of A in A(p) independently with probability p. 
Let Aj be the set of request times for item i G N. 

Next we construct a modified list L consisting of the same items as the original list L. 
Parameters of the new system are denoted with the "hat" symbol. Specifically, Xj{t) 
denotes the position of element j at time t in the list L. Each element j G L is associated 
with an independent Poisson process Aj defined as 



AjilTi/TTj), l<j<i, 

AjUA+, j>i, 



where A^ is an independent Poisson process with rate vTj+i — irj. Note that processes Aj 
are constructed in such a way that they are Poisson with rates vTj for 1 < j < i and vrj+i for 
j > i. In addition, observe that Aj C A-,- for 1 < j < i and Aj 5 Aj for j > i. Furthermore, 
let function ipj{t) be defined as follows 



(Pj{t) ■-- 



i.e., the function (pj{t) indicates whether item j is preceded by an item k such that rates 
of Aj and Afc differ. The request process A to list L is a superposition of Poisson processes 
A,: 

A(t):= U A,{t). (6) 

j: ipj(i) = l 

Item j is requested from L at time t = T if T G Aj fl A. In other words, item j < i is 
requested according to Aj only if it is preceded by an item in {i + 1, i + 2, . . . }. On the other 
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Figure 2: An example of reordering by operator TZi (in this case i = 4 and N = 8). The 
initial states of the lists are shown on the left. The modified list Lat is divided into two 
sublists: {1, 2, 3, 4} and {5, 6, 7, 8}. Upon reordering items in each of the sublists according 
to -Ljv, the new ordering in L^v is shown on the right. 



hand, requests for item j > i are placed according to Aj only if element j is preceded by an 
item in {1, . . . Note that the set of Poisson processes included in A changes with the 
evolution of list L. In addition, the number of elements in the union in ^ is always finite 
and bounded from above by 2i. 

The modified list L operates under the transposition rule with one modification. Namely, 
after the transposition rule rearranges items in either of the lists {L or L), a reordering 
operator TZi is applied to L. The operator works as follows. The list is divided in two 
sublist: {1,2,..., i} and {i + 1, z + 2, . . .}. The operator TZi reorders each sublist of L so 
that the order of elements within the sublists is the same as in the original list L. However, 
only items belonging to the same sublist are allowed to exchange positions in the list. An 
example of how TZi operates is shown in Fig. [5J 

Next, assume that both lists are in the same permutation at time t = 0. Then we argue 
that 



Lemma 3. For every t G 



and 1 < j < i 

Xj{t)<X,{t). 



(7) 



Proof. Either list changes only at times of requests to one of the systems; denote those 
times by < Ti < T2 < • • • < T„, < • • • . Since there are no changes in item order between 
times {T„}, it is sufficient to prove that holds for t = T„+, n > 1. To this end, suppose 
that ((TJ holds for t = T„_i+ and consider the two lists at time t = T„+. There are three 
cases that need to be examined: 

(i) At time t = Tn item j < i is requested from L. By the construction of A, this event 
implies that item j is requested in the modified list L only with some probability 
depending on the state of L. 

If item j is not requested from L, then the set of positions occupied by items {1, ■ ■ ■ ,i} 
in L remains the same. On the other hand, the set of positions occupied by items 
{1, . . . , i} in L either does not change (when an item in {1, ... ,i} precedes item j or 
j is the first item in the list) or is pairwise smaller (when an item in {i + 1, . . . , N} 
precedes item j). This, in conjunction with the fact that the relative order of items 
1, . . . ,i in L and L at t = Tn+ is the same (due to TZi), implies that holds for 
t = Tn+. 

On the other hand, if item j is requested in both lists, then the only case that needs 
to be examined in detail is the one in which j is preceded by item I < i in L; note that 
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from © it follows that j is preceded by an item v > i in L. Since X/(r„— ) < X/(T„— ) 
and the order of the items 1, . . . ,i is the same in both lists (due to TZi) one has 

Xj{Tn-) + l<Xj{Tn-), (8) 

The fact that items j and / are transposed in L but not in L yields that the order of 
j an / is different in the two lists before TZi is applied. Thus, TZi exchanges positions 
of items j and / in L. This leads to 

Xj{Tn+) = XliTn-) > Xi{Tn-) = Xj{Tn+) 

and 

Xl{Tn + ) = Xj{Tn-) - 1 > Xj{Tn-) = Xi{Tn + ), 

where the first inequality follows from the inductive assumption and the second in- 
equality is due to (|HI). Thus, we conclude that ((T)) holds for t = T„+. 

(ii) At time t = Tn item j > i is requested from L. The argument is very similar to the 
one in (i). In this case j is preceded by some v < i in L (see ©)• 

If item j is not requested from L or is preceded (in L) by an item v > i, then the 
positions occupied by items 1, . . . ,z in L do not change. However, in L item j must 
be preceded hy v < i and, thus, v is moved one position back by the transposition 
rule. Alternatively, if in L item j is requested and preceded by / < i then either v = I 
and Xi{Tn—) = X^(T„— ) or v ^ I and 

XliTn-) < XliTn-) + I, 
X^Tn-) < MTn-) + 1. 

In either case after the items are transposed one has Xi(T„+) < X;(T„+) and 

X^iTn + ) < Xv{Tn+)- 

(iii) At time t = Tn item j > i requested from L but not from L. This implies that j is 
preceded in L by an item v > i (see ©)• If J is preceded in L by an item / > i 
the positions occupied by items 1, . . . ,i do not change in either list. Therefore, we 
only need to consider the case when j is preceded by / < i. However, in that case we 
necessarily have 

XliTn-) > XliTn-) + 1, 

which implies 

XiiTn+) = XliTn-) + 1 < XliTn-) = XiiTn + ). 

The preceding (i)-(iii) establish 0. □ 

Next, let variables Zjit), 1 < j < i, be defined by 

(Zi(t),...,Z,(t)) ■.= SiXiit),...,Xiit)), 

where S is the sorting (in the increasing order) operator. Note that the definition of Ziit) 
and Q imply 

i 

Ziit)>\/ Xjit). (9) 
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Observe that the evolution of {Zj{t)} is probabilistically the same as in the constrained 
particle system described in Section |31 (recall that Zj{t) denotes the position of the jth 
particle at time t in Sectional) with p = TTi/{TTi + 7rj_|_i) and q = 7ri+i/(7ri + tTj+i). Indeed, 
Zj{t) increases by one at Poisson rate vTj+i only if Zj+i{t) ^ Zj{t) + 1, and it decreases by 
one at rate vTj only if Zj_i ^ Zj{t) — 1. 

Taking maximums on both sides of @ and applying operator P [ • > x ] leads to 



\/x,it) 



> X 



<F[Zi{t)>x]. 



From the preceding inequality one obtains 



lim ] 

t— >oo 



\/X,it)>x 



< 



\Zi > x] 



= F[ki{p)+i> x], 

where the last equality follows from the definition of in Section|21and (3 = TTj+i/vrj 



□ 



5 Proofs 

Proposition ^ is the primary tool in establishing our results on the performance of the 
transposition rule. The following lemma is a simple consequence of Proposition^ 

Lemma 4. For any y > 1 and distribution of requests it 

lE^ P[C7(t) > x] < F[Ky{TTy+i/TTy) > X - y] + F[R > y , R + kr{'Kr+i/ttr) > x]. 



Proof. Conditioning on the requested item and using the monotonicity of the max-operator 
result in 



lim F[C{t) > x]= lim V7riP[Xi(t) > x] 



< 



ETTj lim 
t^oo 



i=l 



\JXj{t)>x 



\J Xj{t) > X + TTj lim 

J=l J i>y+l 

< F[Ky{7ry+i/Try) > X - y] + F [R > y , R + tiRi-KR+i/TTR) > x], 
where the last inequality follows from Proposition ^ 

At this point we present the proofs of Theorems ^ and [2 



□ 



Proof of Theorem^ The lower bound is an immediate consequence of Lemma ^ and holds 
for any distribution of requests vr. Hence, we only consider the upper bound. 
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We first examine the case when tt is asymptotically geometric with parameter v. Fix 
arbitrary small e > such that u + e < 1. By the assumption, there exists such that 
v — e < vTj^i/vrj < h' + e for all i > z^. For any > u + e, setting y = in Lemma |3] yields 

F[C{t) >x]< F[Ki^ (u + e) > F[R + kr{u + e) > x] 

t—l-OO 

< s-^s^-'Es''»-('^+=) + s-^Es-^+''«(''+^), (10) 

where the last step is due to Markov's inequality. From > v + e and © it follows 
^gR+KR{v+e) ^ rpi^ig bound, ® and (UHl) result in 

x'^ Ih^I logP[C(t) > x] < -logs + x^Mog fs*=Es'''-(''+=) +Es^+''«(''+") 

^-logs, (11) 
as X — > oo. On the other hand, note that 

¥[R > x] y^TTi^iiy-ey-'^ 

i>x 

implying 



lim x~MogP[i2 > x] > log(z^ - e). (12) 

Combining Hll|) and ()12() . we obtain 

^ — . — logP[C(t) > xl logs^i 
lim lim — — — -— ^ < 



x^oot^oo logP[i?>x] log(i^ — e) ' 

passing s^^ i ly + £ and then letting e J. yield the result. 

Next, we consider the case when tt is power law with parameter a > 1. Lemma|l]and © 
yield 

Ik^^ F[C{t) > X]< F[Ky{TTy+l/TTy) > X " 7/ ] + P [ij > 7/ ] 



<(1-^] Y^Y +F[R>y]. 



— 1 / \ x—y 

(^) 

Letting y = \ex / log x] for a sufficiently small e > results in an estimate on the two terms 
in the preceding sum, as x — > oo, 

1 _ %±iy' ( !hi±iy = a-'e-yx--/^l + o{l)) 



and 

F[R>y]=^ci-" = 0{y'-+'). 

i>y 

Therefore, as x — > oo, 

Ih^ F[C{t) >x]< F[R > y](l + o(l)). 

t^oo 

The preceding equation together with the fact that tt is a power law yield the statement of 
the statement of the theorem. □ 
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Proof of Theorem\^ The proof is very similar to the one of Theorem ^ and, thus, we omit 
details. Since 

logP[Cjv > x] _ log F[Cjv > x] log¥[R > x] 
logF[RN > x] ~ logP[i? > x] logF[RN > x] 

and the upper bound on P[Civ > x] is the same as the one on limP[C(t) > x], we only 
need to verify that the last fraction in the preceding equality tends to one as {x, N} oo. 
However, that easily follows from the assumptions of the theorem. □ 

6 Concluding remarks 

We presented an analysis of the transposition rule based on a coupling with a constrained 
exclusion process. As an outcome, we established an asymptotic optimality of the transpo- 
sition rule in linear lists. Specifically, when the probability distribution of the requests is 
power law or geometric we showed that, under the transposition rule, the logarithm of the 
tail probability of the search cost is asymptotically optimal. 

While the steady-state distribution of the search cost is a primary quantity of interest, 
rates of convergence play an important role in assessing the applicability of self-organizing 
algorithms in practice. The proposed coupling may offer new directions for understanding 
these rates under the transposition rule. The same question, for the related move-to-front 
algorithm, was investigated in 0. As pointed out in j^j, the transposition rule is expected 
to have slower rates of convergence than the move-to-front rule. 
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